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1 TACTAAAGGG AACAAAAGCT GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA 
ATGATTTCCC TTGTTTTCGA CCTCGAGGTG GCGCCACCGC CGGCGAGATC TTGATCACCT 

5' UTR 



TCCCCCGGGC TGCAGGAATT CGGCACGAGG AACTTTCTGC CTCGTTTTTT TGCTCCTACT 
AGGGGGCCCG ACGTCCTTAA GCCGTGCTCC TTGAAAGACG GAGCAAAAAA ACGAGGATGA 
5* UTR SEQ ID NO: 3 



MS Q E I V QSG QTY 
GTTTTTCTCT TCCAGTTTCT ACCATGTCGC AAGAAATTGT TCAATCAGGA CAAACCTACA 
CAAAAAGAGA AGGTCAAAGA TGGTACAGCG TTCTTTAACA AGTTAGTCCT GTTTGGATGT 
SEQ ID NO: 3 



IITN AKS GTV VDLS GED N K S 

TCATCACTAA CGCCAAATCC GGCACAGTTG TTGACCTTTC GGGCGAAGAC AACAAATCTA 

AGTAGTGATT GCGGTTTAGG CCGTGTCAAC AACTGGAAAG CCCGCTTCTG TTGTTTAGAT 

IIGF PKH GGT NQRW TLN WTG 

TTATTGGATT TCCCAAGCAT GGAGGAACAA ATCAGAGGTG GACCCTCAAC TGGACAGGGA 

AATAACCTAA AGGGTTCGTA CCTCCTTGTT TAGTCTCCAC CTGGGAGTTG ACCTGTCCCT 

SEQ ID NO: 5 



KSWT FRS VSS EMYL GLN GSP 
AGAGTTGGAC TTTCCGCTCC GTTTCTTCTG AAATGTATCT TGGCCTGAAT GGCTCGCCGT 
TCTCAACCTG AAAGGCGAGG CAAAGAAGAC TTTACATAGA ACCGGACTTA CCGAGCGGCA 

SEQ ID NO: 4 (partial) 



SEQ ID NO: 5 



SEQ ID NO: 6 (partial) 



SDGT KLV A V T TPVE WRI WH 
CTGATGGAAC AAAACTGGTA GCCGTGACCA CCCCTGTTGA GTGGCGCATC TGGCACGA 418 
GACTACCTTG TTTTGACCAT CGGCACTGGT GGGGACAACT CACCGCGTAG ACCGTGCT 
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5' UTR 



START 
MS Q E I 

1 GCCTCGTTTT TTTGCTCCTA CTGTTTTTCT CTTCCAGTTT CTACCATGTC GCAAGAAATT 
CGGAGCAAAA AAACGAGGAT GACAAAAAGA GAAGGTCAAA GATGGTACAG CGTTCTTTAA 

VQS GQTY IIT NAK SGTV VDL 
61 GTTCAATCAG GACAAACCTA CATCATCACT AACGCCAAAT CCGGCACAGT TGTTGACCTT 
CAAGTTAGTC CTGTTTGGAT GTAGTAGTGA TTGCGGTTTA GGCCGTGTCA ACAACTGGAA 

SGE DNKS IIG FPK HGGT NQR 
121 TCGGGCGAAG ACAACAAATC TATTATTGGA TTTCCCAAGC ATGGAGGAAC AAATCAGAGG 
AGCCCGCTTC TGTTGTTTAG ATAATAACCT AAAGGGTTCG TACCTCCTTG TTTAGTCTCC 

WTL NWTG KSW 
181 TGGACCCTCA ACTGGACAGG GAAGAGTTGG A 211 
ACCTGGGAGT TGACCTGTCC CTTCTCAACC T 
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VDL SGE DNK S I I G FPK HGG 
1 TTGTTGACCT TTCGGGCGAA GACAACAAAT CTATTATTGG ATTTCCCAAG CATGGAGGAA 
AACAACTGGA AAGCCCGCTT CTGTTGTTTA GATAATAACC TAAAGGGTTC GTACCTCCTT 

TNQR WTL NWT GKSW TFR SVS 
61 CAAATCAGAG GTGGACCCTC AACTGGACAG GGAAGAGTTG GACTTTCCGC TCCGTTTCTT 
GTTTAGTCTC CACCTGGGAG TTGACCTGTC CCTTCTCAAC CTGAAAGGCG AGGCAAAGAA 

SEMY LGL NGS PSDG TKL VAV 
121 CTGAAATGTA TCTTGGCCTG AATGGCTCGC CGTCTGATGG AACAAAACTG GTAGCCGTGA 
GACTTTACAT AGAACCGGAC TTACCGAGCG GCAGACTACC TTGTTTTGAC CATCGGCACT 

TTPV EWH IWH D E V D PST YRI 
181 CCACCCCTGT TGAGTGGCAC ATCTGGCACG ACGAAGTTGA CCCTTCAACT f ATCGTATCT 
GGTGGGGACA ACTCACCGTG TAGACCGTGC TGCTTCAACT GGGAAGTTGA ATAGCATAGA 

A/G polymorphism 

FVPF TTF NMD LYAQ GSA APG 
241 TTGTACCTTT CACCACATTC AACATGGACC TCTACGCCCA RGGTAGTGCC GCCCCTGGTA 
AACATGGAAA GTGGTGTAAG TTGTACCTGG AGATGCGGGT YCCATCACGG CGGGGACCAT 

T/C polymorphism 

TPIT TWY TWK GIHQ TWR FEL 
301 CGCCTATCAC AACTTGGTAT ACATGGAAGG GYATCCACCA AACGTGGAGG TTTGAACTAG 
GCGGATAGTG TTGAACCATA TGTACCTTCC CRTAGGTGGT TTGCACCTCC AAACTTGATC 

T/G polymorphism 

STOP 

3' UTR 

A * 

361 CTTAGGKTCA GGTTTCGGAT GTAATTTGTG TGTGTAAATC TTCTTGGACC ATGTTGTGCT 
GAATCCMAGT CCAAAGCCTA CATTAAACAC ACACATTTAG AAGAACCTGG TACAACACGA 

3' UTR 

4 21 TTTATTGTAC TCCGCTTGTT ATCATTATAC CCACCTATGT TGCAACATCT TTTTGGATCC 
AAATAACATG AGGCGAACAA TAGTAATATG GGTGGATACA ACGTTGTAGA AAAACCTAGG 
PolyA tail 



3' UTR 

481 CAAAAAAAAA AAA 493 
GTTTTTTTTT TTT 
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START 



M SQE IVQ S G 0 T Y I I 
1 TCTCTTCCAG TTTCTACCAT GTCGCAAGAA ATTGTTCAAT CAGGACAAAC CTACATCATC 
AGAGAAGGTC AAAGATGGTA CAGCGTTCTT TAACAAGTTA GTCCTGTTTG GATGTAGTAG 

TNA KSGT VVD LSG EDNK SII 
61 ACTAACGCCA AATCCGGCAC AGTTGTTGAC CTTTCGGGCG AAGACAACAA ATCTATTATT 
TGATTGCGGT TTAGGCCGTG TCAACAACTG GAAAGCCCGC TTCTGTTGTT TAGATAATAA 

GFP KHGG TNQ RWT LNWT GKS 
121 GGATTTCCCA AGCATGGAGG AACAAATCAG AGGTGGACCC TCAACTGGAC AGGGAAGAGT 
CCTAAAGGGT TCGTACCTCC TTGTTTAGTC TCCACCTGGG AGTTGACCTG TCCCTTCTCA 

WTF RSVS SEM YLG LNGS PSD 
181 TGGACTTTCC GCTCCGTTTC TTCTGAAATG TATCTTGGCC TGAATGGCTC GCCGTCTGAT 
ACCTGAAAGG CGAGGCAAAG AAGACTTTAC ATAGAACCGG ACTTACCGAG CGGCAGACTA 

GTK LVAV TTP VEW HIWH DEV 
241 GGAACAAAAC TGGTAGCCGT GACCACCCCT GTTGAGTGGC ACATCTGGCA CGACGAAGTT 
CCTTGTTTTG ACCATCGGCA CTGGTGGGGA CAACTCACCG TGTAGACCGT GCTGCTTCAA 

DPS TYRI FVP FTT FNMD LYA 
30 1 GACCCTTCAA CTTATCGTAT CTTTGTACCT TTCACCACAT TCAACATGGA CCTCTACGCC 
CTGGGAAGTT GAATAGCATA GAAACATGGA AAGTGGTGTA AGTTGTACCT GGAGATGCGG 

A/G polymorphism C/T polymorphism 

QGS AAPG TPI TTW YTWK GIH 
361 CAAGGTAGTG CCGCCCCTGG TACGCCTATC ACAACTTGGT ATACATGGAA GGGCATCCAC 
GTTCCATCAC GGCGGGGACC ATGCGGATAG TGTTGAACCA TATGTACCTT CCCGTAGGTG 



G/T polymorphism 



STOP 



421 



Q T W R F E L 
CAAACGTGGA GGTTTGAACT 
GTTTGCACCT CCAAACTTGA 



A * 

AGCTTAGGGT CAGGTTTCGG ATGTAATTTG T 4 91 
TCGAATCCCA GTCCAAAGCC TACATTAAAC A 
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STOP 




exon 1 




intron 1 



intron 2 



intron 3 



intron 4 
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FIG. 6 

START 

exon 1 



M SQE IVQ SGQT YII 

1 TCTCTTCCAG TTTCTACCAT GTCGCAAGAA ATTGTTCAAT CAGGACAAAC CTACATCATC 

AGAGAAGGTC AAAGATGGTA CAGCGTTCTT TAACAAGTTA GTCCTGTTTG GATGTAGTAG 

exon 1 



intron 1 



TNA KSGT VVD LSG EDNK S 

61 ACTAACGCCA AATCGGGCAC AGTTGTTGAC CTTTCGGGCG AAGACAACAA ATCTAGTAAG 

TGATTGCGGT TTAGGCCGTG TCAACAACTG GAAAGCCCGC TTCTGTTGTT TAGATCATTC 

intron 1 



121 TCGTTTTTAG TCCCATGTTT TTTTTTGTCA AAAAAAATTG ACTGACATAT TTTGTCTCCA 
AGCAAAAATC AGGGTACAAA AAAAAACAGT TTTTTTTAAC TGACTGTATA AAACAGAGGT 

exon 2 



intron 1 intron 2 



IG FPKH GGT NQR 
181 GTTATTGGAT TTCCCAAGCA TGGAGGAACA AATCAGAGGG TAGGTCTAGA AATGCACCTC 
CAATAACCTA AAGGGTTCGT ACCTCCTTGT TTAGTCTCCC ATCCAGATCT TTACGTGGAG 

exon 3 



intron 2 



WT LNW TGKS 

241 GTTAATATTG GTTTTTATTG ACATTCATGA ACAGTGGACC CTCAACTGGA CAGGGAAGAG 

CAATTATAAC CAAAAATAAC TGTAAGTACT TGTCACCTGG GAGTTGACCT GTCCCTTCTC 

exon 3 



WTF RSV SSEM YLG LNG SPSD 
301 TTGGACTTTC CGCTCCGTTT CTTCTGAAAT GTATCTTGGC CTGAATGGCT CGCCGTCTGA 
AACCTGAAAG GCGAGGCAAA GAAGACTTTA CATAGAACCG GACTTACCGA GCGGCAGACT 

exon 3 



GTK LVA VTTP VEW HIW HDEV 
361 TGGAACAAAA CTGGTAGCCG TGACCACCCC TGTTGAGTGG CACATCTGGC ACGACGAAGT 
ACCTTGTTTT GACCATCGGC ACTGGTGGGG ACAACTCACC GTGTAGACCG TGCTGCTTCA 
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exon 3 intron 3 



DPS T Y 

4 21 TGACCCTTCA ACTTATCGGT GAGTCCCCTA AATATTACTT GCTTGTGGTT CATACTAATA 
ACTGGGAAGT TGAATAGCCA CTCAGGGGAT TTATAATGAA CGAACACCAA GTATGATTAT 

intron 3 exon 4 



IF VPFT TFN MDL YAQG 

481 CGTCGTTCGA AGTATCTTTG TACCTTTCAC CACATTCAAC ATGGACCTCT ACGCCCAGGG 

GCAGCAAGCT TCATAGAAAC ATGGAAAGTG GTGTAAGTTG TACCTGGAGA TGCGGGTCCC 

exon 4 



SAA PGT PITT WYT WKG IHQT 
541 TAGTGCCGCC CCTGGTACGC CTATCACAAC TTGGTATACA TGGAAGGGTA TCCACCAAAC 
ATCACGGCGG GGACCATGCG GATAGTGTTG AACCATATGT ACCTTCCCAT AGGTGGTTTG 

intron 4 



exon 4 



W R F EL 

601 GTGGAGGTTT GAACTAGGTA GGGCTTGCGA TCTCACCCGG ATCCTCCATG AACTAATGTG 
CACCTCCAAA CTTGATCCAT CCCGAACGCT AGAGTGGGCC TAGGAGGTAC TTGATTACAC 

intron 4 STOP 



661 ATCACGTCGT GTTCTAGCTT AGGTTCAGGT TTCGGATGTA ATTTGT 706 
TAGTGCAGCA CAAGATCGAA TCCAAGTCCA AAGCCTACAT TAAACA 
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